Determining Hit Rate in Pattern Search

نویسندگان

  • Richard J. Bolton
  • David J. Hand
  • Niall M. Adams
چکیده

The problem of spurious apparent patterns arising by chance is a fundamental one for pattern detection. Classical approaches, based on adjustments such as the Bonferroni procedure, are arguably not appropriate in a data mining context. Instead, methods based on the false discovery rate the proportion of flagged patterns which do not represent an underlying reality may be more relevant. We describe such procedures and illustrate their application on a marketing dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Heparin induced thrombocytopenia

Abstract Background and Objectives Heparin is still a commonly used anticoagulant in prophylaxis and treatment of thromboembolic events. Heparin-induced thrombocytopenia (HIT) is a life-threating adverse drug reaction of heparin. The diagnosis of HIT is made based on two important criteria, firstly clinical evaluation and secondly laboratory testing. In this comprehensive review, the authors w...

متن کامل

Journal of Clinical and Diagnostic Research

Various algorithms are in use in medical processes to improve the speed, sensitivity and accuracy of the computations and analyses involved in those experiments. The aim of this paper is to suggest three improvements, namely Multi Hit, Dropoff percentage and NCM-2 in the BLAST algorithm. BLAST (Basic Local Alignment Search Tool) is a popular tool used for determining the patterns in genomic seq...

متن کامل

Evaluation of Ontology-based User Interests Modeling

Deriving users’ interests from their online searching and browsing behaviors is an important research direction with several applications in content search and management. Manually edited Web directories, such as Open Directory Project (ODP) or Yahoo! 2 directory, provide ontology of concepts (categories) along with pages relevant to those categories. Aiming to evaluate and compare the performa...

متن کامل

Aiming strategy error analysis and verification of a billiard training system

A low cost training system is proposed for regular billiard game tutoring. We describe the elements to construct an interactive computer system which helps train billiard players in enhancing their skills. Most research on computer billiard has focused on creating highly competitive billiard playing programs, based on various search algorithms. Game playing strategies are embedded into these pr...

متن کامل

A machine learning approach for result caching in web search engines

A commonly used technique for improving search engine performance is result caching. In result caching, precomputed results (e.g., URLs and snippets of best matching pages) of certain queries are stored in a fast-access storage. The future occurrences of a query whose results are already stored in the cache can be directly served by the result cache, eliminating the need to process the query us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002